6/14/2018
Get text of interview: xml2 package (read.xml), rvest
Pull author names from text: Regular expressions (!!!) - stringr
str_extract_all(string = btblines[x],pattern = "((?<![“]) ([:upper:]{1}(\\. )?)+[:lower:]+(?=([ \\’\\'-][:upper:]{1} (\\. )?)+)(?:[\\s\\’\\'-][:upper:]{1}(\\. )?[[:upper:]{1} ([:lower:]\\'+)-]+)+)")
Connect to goodreads & wikipedia APIs for gender & birthdate: xml2, WikipediR, tidytext
Analyze: dplyr, ggplot
Top Authors Mentioned Together
George Saunders & Lin-Manuel Miranda